Picture for Manoj Karkee

Manoj Karkee

Center for Precision and Automated Agricultural Systems, Washington State University

Vision-Language-Action Models: Concepts, Progress, Applications and Challenges

Add code
May 07, 2025
Viaarxiv icon

Design, Integration, and Evaluation of a Dual-Arm Robotic System for High Throughput Tissue Sampling from Potato Tubers

Add code
May 01, 2025
Viaarxiv icon

Plant Disease Detection through Multimodal Large Language Models and Convolutional Neural Networks

Add code
Apr 29, 2025
Viaarxiv icon

A Review of 3D Object Detection with Vision-Language Models

Add code
Apr 25, 2025
Viaarxiv icon

RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity

Add code
Apr 17, 2025
Viaarxiv icon

Adaptive Vision-Guided Robotic Arm Control for Precision Pruning in Dynamic Orchard Environments

Add code
Apr 09, 2025
Viaarxiv icon

Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models

Add code
Feb 21, 2025
Viaarxiv icon

Image, Text, and Speech Data Augmentation using Multimodal LLMs for Deep Learning: A Survey

Add code
Jan 29, 2025
Viaarxiv icon

Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards

Add code
Dec 07, 2024
Viaarxiv icon

Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development

Add code
Nov 18, 2024
Viaarxiv icon